PharmKE: Knowledge Extraction Platform for Pharmaceutical Texts Using Transfer Learning

نویسندگان

چکیده

The challenge of recognizing named entities in a given text has been very dynamic field recent years. This is due to the advances neural network architectures, increase computing power and availability diverse labeled datasets, which deliver pre-trained, highly accurate models. These tasks are generally focused on tagging common entities, but domain-specific use-cases require custom not part pre-trained can be solved by either fine-tuning models, or training main lies obtaining reliable test manual labeling would tedious task. In this paper we present PharmKE, analysis platform pharmaceutical domain, applies deep learning through several stages for thorough semantic articles. It performs classification using state-of-the-art transfer thoroughly integrates results obtained proposed methodology. methodology used create accurately then train models entity tasks, centered domain. compared fine-tuned BERT BioBERT trained same dataset. Additionally, PharmKE from recognition resolve co-references analyze relations every sentence, thus setting up baseline additional such as question answering fact extraction. recognized also expand knowledge graph generated DBpedia Spotlight text.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Hierarchical Functional Concepts for Knowledge Transfer among Reinforcement Learning Agents

This article introduces the notions of functional space and concept as a way of knowledge representation and abstraction for Reinforcement Learning agents. These definitions are used as a tool of knowledge transfer among agents. The agents are assumed to be heterogeneous; they have different state spaces but share a same dynamic, reward and action space. In other words, the agents are assumed t...

متن کامل

Transfer Learning Based Cross-lingual Knowledge Extraction for Wikipedia

Wikipedia infoboxes are a valuable source of structured knowledge for global knowledge sharing. However, infobox information is very incomplete and imbalanced among the Wikipedias in different languages. It is a promising but challenging problem to utilize the rich structured knowledge from a source language Wikipedia to help complete the missing infoboxes for a target language. In this paper, ...

متن کامل

Active Learning for Crowdsourcing Using Knowledge Transfer

This paper studies the active learning problem in crowdsourcing settings, where multiple imperfect annotators with varying levels of expertise are available for labeling the data in a given task. Annotations collected from these labelers may be noisy and unreliable, and the quality of labeled data needs to be maintained for data mining tasks. Previous solutions have attempted to estimate indivi...

متن کامل

Knowledge Extraction From Texts By Sintesi

In this paper we present SINTESI, a system for the knowledge extraction from Italian inputs, currently under development in our re,search centre. It is used on short descriptive diagnostic texts, in order to summarise their technical content and to build a knowledge base on faults. Often in these texts complex linguistic constructions like conjunctions, negations, ellipsis and anaphorae are inv...

متن کامل

Rule Extraction for Transfer Learning

Typically rule extraction is done for the purposes of human interpretation. However, there are other possible applications of rule extraction. One practical application is transfer learning, in which knowledge learned in one task is used to aid in learning a related task. The extracted rules, which explain the learned solution to the first task, can be considered advice on how to approach the s...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Computers

سال: 2023

ISSN: ['2073-431X']

DOI: https://doi.org/10.3390/computers12010017